Here, I produce some of the figures for the paper.

Data from shared simulations

Recall RMSE result:

 Inference Method MASH    ASH  eqtlBMA-lite
RMSE_S            0.010 0.030   0.047
RMSE_TS           0.008 0.025   0.043
cor.with.truthS   0.99  0.94    0.84
cor.with.truthT S 0.99  0.94    0.82

And here we plot number sig vs significance threshold:

Data from tissue.specific.shared simulations

And here we plot number sig vs significance threshold:

Now show the scatterplots demonstrating the tissue.specific scenario. Recall that the purpose is to simulate according to the patterns of sharing present in the GTEX V6 Data, where 100 snps in cis of 500 genes, \(\pi_0\) is 0.80, and there is sharing according to gtex covmats 2:9 and about 35% tissues specific.We can also plot the scatterplot to observe the behavior of ‘null’(in red) and ‘real’(in blue) associations. We see that our joint analysis reduces the Root Mean Squared Error (RMSE) and provides ample shrinkage of ‘null values’.

alt text alt text alt text alt text

REAL DATA RESULTS

Now, we would like to do this for the real data, plotting the posterior mean vs the input summary statistic using both ash and MASH.

alt text alt text

Recall Power Results:

Metric                    LFSR_MatrixAsh LFSR_ASH   LFSR_eQTL-BMALite
Significant b_{jr} ≤ 0.05 393,414           91,755 401,552

Now, we would like to compare the pi.matrices to see from where the prior weight is arising. Here, we demonstrate the ‘parsing’.

And here, I wanted to demonstrate several ‘tissue’ by ’tissue plots, where eQTL are grouped by their loading. We see some tissue specific activity, some highly correlated (e.g., both colons), and tissue pairs in which SNPS are active in both (skeletal muscle and brain) but the ratios differ among groups.

Now, I want to give a gross overview of all the patterns of correlations. Here, I plot heatmaps of all 8 learned matrices:

compiledheatmaps.png.

According to the text, I give examples of \(U_{k}=3\), \(U_{k}=9\) and eQTLBMA lite config (for qualitative specificity). In the example for 3, strong, shared effects in brain tissues match an 349 underlying pattern of shared effects present in the larger data set and thus allows this 350 gene-snp pair to find its true match. Brain effect sizes thus borrow strength from one 351 another, and accordingly, the posterior estimates tend to nudge the brains towards a onsistent, shared effect. Similarly, an overall tendency towards consistency in sign in the larger data set, as captured by the hierarchical model and reflected in the positive correlation in sign among all tissues, tends to ‘flip’ erratic off directions towards the prevailing positive direction. Heterogeneity in magnitude among the other tissues is 356 reflected in the variety of banding intensity along the diagonal.

In this example, though the particular pattern featured (Uk = 9) captures correlation in sign among all tissues, significant quantitative heterogeneity is again reflected in the intensity of the banding along the diagonal, in this dramatically dichotomous between testes/whole blood and all other tissues. Here, we introduce the idea of quantitative specificity - e.g., that a SNP can be modestly ‘active’ in all tissues though to dramatically different degrees. here, though this matrix was learned (and not forced, as in eQTLBMA-lite) from the data, the pattern of quantitative tissue specificity in testes and whole blood is evident. Again, erratic, off-directions are flipped in sign. We refer to this as quantitative specificity, because the effects are quantitatively unique toparticular tissues - e.g., significantly larger in magnitude in testis than all other tissues - and yet considered non-zero in all tissues. This is in contrast to qualitative specificity, described below, in which we would conclude that the QTL is active in only one tissue.

Lastly, the inclusion of the eQTLBMA lite configurations (in which the SNP has a 378 non-zero effect in only one tissue) coupled with the learned patterns of tissue specificity 379 evident in matrices Uk : 5 − 9 serve to allow the preservation of qualitatively specific 380 effects. Here, we show a gene-snp pair demonstrating high loading on one of the 381 eqtlbma-lite configuration matrices - indeed, we reject the significance of the effect size 382 estimates in all tissues but testes, a pattern consistent with the presence of 383 tissue-specificity described below. Together, these results cement the resolution afforded 384 by methods which can distinguish among tissues in which a QTL is called active, 385 beyond reducing genetic effects to binary ‘on’ or ‘off’ conclusions

Now we discuss Tissue specificity

Furthermore, tissue specific eQTL demonstrate the smoothing feature of this joint shrinkage approach.

First, we plot the ‘singletons’ of the observed noisy values.

Now, we plot the matrix-ash shrunken \(E(Z|Data)\).

Heterogeneity Analysis

First, we asked in how many tissues is a QTL signficiant.

But now, we can ask much more interesting questions.

                                              All  No Brains
$E(ConsistentSignPosteriorMean | LFSR≤0.05)$  0.852 0.802
E(At least 50% max value)                     0.354 0.449
Normalized B_jr >0                            0.833 0.879

Heterogeneity in sign:

And we consider heterogeneity in magnitude:

Some additional quantities we may want to consider:

Now, let’s consider the number of tissues vs normalziing value. We plot the median of the value used to normalize for each group of SNPs that are of the same number of tissue class. For each ‘s’ class, grab QTL that have that many tissues > max effect, find the effect used to normalize them, and take median of those s class maxes. We can see that the ‘brain homogenous effects’ (i.e., tissues in which the brain was chosen as the tissue of max effect) seem to have a lower median max effect, indicating that when brain is the highest, the other effects tend to be small.

How do the tissues with high or low values of our heterogeneity index behave?

Let’s looks at how heterogeneity behaves. Here, we plot examples in which the vast majority of tissues have a heterogeneity index greater than 50%. We plot the E(Z|D) for those in which the QTL were ‘homogeneous’ in most:

Now, we look at eQTL in which the effects are consistent in only a few:

From this comparison we can also see that the non-homogenous eQTL tned to have smaller max effects.

Lastly, we can consider the biplot which plots the Posterior mean vs the Value used to normalize it, to consider if large effects occur in the presence of similar size effects (e.g., normalized effect sizes close to one)

alt text